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A METHOD OF CALCULATING INTERNAL SIGNALS FOR USE IN A MAP 

ALGORITHM 

FIELD OF THE INVENTION 

This invention relates to the method for calculating internal signals used in a MAP algorithm, 
more particularly, but not exclusively, for use in the max-log-MAP (maximum a posteriori) 
algorithm. 

BACKGROUND OF THE INVENTION 

The max-log-MAP decoding algorithm is a simplified version of the MAP decoding 
algorithm, which is also known as the BCJR decoding algorithm. The max-log-MAP is used to 
decode a convolutional encoder. It is one of the more popular soft output decoding methods. 

A main application of the max-log-MAP decoder is in the decoding of turbo codes, which are 
generated using two parallel concatenated convolutional encoders. Two max-log-MAP 
decoders are used as component decoders in the turbo decoder. Today, turbo codes have 
become established as an important method of chaimel coding in the third generation wireless 
mobile commimication technical specification. However, as time progresses and demand for 
high data rate communication increases, the turbo decoder is burdened with the task of 
performing with very high processing speed. Improving the speed of the component decoders 
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of the turbo decoder, i.e. the max-log-MAP decoders, is the key to increasing turbo decoding 
speed. 

The conventional max-log-MAP algorithm will now be described. 

At the input of the max-log-MAP decoder are two kinds of received signals: received 
systematic symbols X and received coded symbols Y. A third possible kind of input signal is a 
lambda X signal from the previous max-log-MAP decoder output if iterative decoding (turbo 
decoding) is applied. These signals are used to compute lambda, or in the case of turbo 
decoding, new lambda values. Several important internal signals of the max-log-MAP are 
defined: a, P, and Y. 

a and P are computed recursively - i.e. the computation of the intemal signal a or p at a 
symbol sequence t needs the value of the intemal signal from the previous, for a, or following, 
for p, symbol sequence (t-1 or t+1). 

There are two types of T values: Y*^ is associated wifli the probability of a systematic symbol 
X being bit 0, while Y^ is associated with the probabiUty of the systematic symbol being bit 1 . 
Y is computed using X, Y and the lambda from the previous max-log-MAP decoder output, 
denoted by Xp. 

Signal Y is calculated according to a trellis diagram. Table 1 gives and example of how YO and 
Yl, given states m, are calculated in an 8-state recursive systematic convolutional (RSC) 
encoder with the following transfer function: [1, gi(D)/go(D)] where go(D)=l+D^+D^ and 
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gi=l+D+D^. In the above polynomials, the Ds describe a set of different Hamming distances 
(with respect to all zeros) that can be obtained from a convolutional encoder. Note that, 1 
actually referes to (Hamming distance of 0). Here, an 8-state RSC encoder is taken as an 
example. In fact, m can be other states, such as 16 states or 32 states, depending on the 
application. 
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Table 1 : Example of Y values for states m 



The a for symbol sequence t and states m (total eight different states) is calculated using a and 
Y at t-1 from two different states (m^ and m*). This is done by first computing an 
unnormalized signal a 

at(m) = max-of-2 { aM(m^) + Y^.i(m^) , at.i(m^) + Y\.i(m^) } (1) 

Using the example of the same RSC encoder used in Table 1, Table 2 gives the values of m° 
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and m corresponding to m, wherein states and are selected arbitrarily from eight 

different states m. 

The initial values ao(m) are set to appropriate constants. Usually, ao(0) is set to 0 and {ao(l), 
ao(2),..., ao(7)} are set to a large negative number, for example, -128. 
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Table 2: Example of m*' and values corresponding to m for a computation 
After all eight states are computed, all a values are normalized by subtracting a constant At. 
at(m) = at(m)-A, (2) 
At is a function of Ot which can be obtained in several ways. Some of them are 

At = max-of-8 { at(m) }m6{0.i 7} (3a) 

At = ( max-of-8 { fit(m) >me(o.i....,7} + min-of-8 { at(m) }me{o,i....,7) ) / 2 (3b) 
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At = at(0) (3c) 

Similarly, p is computed using information from different states (m° and m^) and the symbol 
sequence t+1, and using Y from states m at sequence t. 

First, the unnormalized is computed 

fit(m) = niax-of-2 { Ptf i(m°)+ Y^m), ptfi(m^)+ Y\(m) } (4) 

Table 3 gives the values of m° and m^ corresponding to m, using the same RSC encoder. 

The mitial value fiL+i(0), where L is the frame size, is usually set to 0, while {fiL+i(l), 
gL+i(2),...,gL+i(7)} are usually set to a large negative number, for example, -128. 
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Table 3: Example of and values corresponding to m for P computation 
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After all 8 states are computed, all g values are normalized by subtracting a constant Bt which 
is obtained in the similar way as At 

Pt(m) = fit(m)-Bt (5) 

Lambda is computed using a, p and Y of all states. The mP and m^ states for P are the same as 
given in Table 3. 

Xt= max-of-8 { at(m) + Y\(m) + Pt+i(m^) } me{o,i,...,7} 

- max-of-8 { at(m) + Y^m) + pt+iCm^) } me{o.i,..,j} (6) 

As can be seen from the equations, two or more symbols of a or P caimot be computed in 
parallel, and therefore must be computed in sequence. However, the sequential computation of 
a can run in parallel with the sequential computation of p. Therefore, as a and .p have exactly 
the same complexity and are independent of each other, from this point, the a and p 
computations are referred as a single entity called a/p computations, for convenience sake. 

Lambda computations, however, are not independent of a/p computations. The lambda 
computation for symbol sequence t caimot begin until a and p of symbol sequence t are 
computed. 

Lookmg at the algorithm for computing a/p for symbol sequence t (equations (1) - (5)), it can 
be seen that the algorithm can be divided into four dependent sequential stages. In other 
words, one stage cannot begin until the previous has been completed. For example, to compute 
a for symbol sequence t, the four stages Sl(t) - S4(t) are 
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Sl(t): Compute Y^'t-iCm^ and YS.i(m') using Table 1 
S2(t): Compute at(m) using equation (1) 
S3(t): Compute At using one of the equations (3a), (3b), (3c) 
S4(t): Compute a^m) using equation (2) 

It is possible to implement pipelining by starting S2 for the next symbol sequence t+1 after 
S4(t). This is because Sl(t) is independent of any stage at any symbol sequence. Figure 1 
iUustrates an example of the timeline of such an implementation, with the assumption that the 
length of time taken for the stages SI - S4 follows the proportion of T1:T2:T3:T4 = 
1:1:1.5:0.5. 



In general, the length of time taken to complete the stages given L number of symbols 



is 



T = Tl + L (T2 + T3 + T4) 



(7) 



Using the prior art, the pipelining implementation is limited by the normalization computation 
(S3 and S4). In other words, part of the computation (referring to S2) of symbol sequence t+1 
can begin only after the normalization computation in symbol sequence t is completed. 

One way to improve this is to forgo the normalization computation for a few symbols 
(normalization can never be completely eliminated because it is needed to prevent overflow), 
but this would produce different results and slightly increase the number of bits which are set 
aside for the internal signals. 
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It is an object of the invention to provide a method of calculating decoding signals that can 
improve the pipelining implementation. 

. SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided a method of calculating intemal 
signals for use in a MAP algorithm, comprising the steps of: obtaining first decoding signals 
by processing received systematic and received encoded symbols of each symbol sequence of 
a received signal; obtaining vmnormalized second decoding signals for the current symbol 
sequence by processing the first decoding signals of the previous sequence and second 
decoding signals of the previous sequence; obtaining imnormalized third decoding signals for 
the current symbol sequence by processing the first decoding signals of the current sequence 
and third decoding signals of the next sequence; normalizing the unnormalized second and 
third decoding signals; and wherein at least one of said second decoding signals of the 
previous sequence and said third decoding signals of the next sequence are imnormalised. 

Preferably, said fu-st decoding signals are of two types, one type being associated with the 
probability of a said systematic symbol bemg 0 and the other type being associated with the 
probability of a said systematic symbol being 1 . 

Preferably, the step of obtaining current imnormalized second decoding signals is 
implemented by: 

at(m) = max-of-2 { aM(m^) + YVi(m^ , at.i(m^) + Y^.l(m^) } - Am 
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where at(m) are said current unnormalized second decoding signals for states m, aM(m^) and at- 
i(m^) are respectively said prior unnormalized second decoding signals, for states and m^ 
YVi(ni^) and Y\.i(m^) are respectively said two types of said prior first decoding signals for 
states and m\ At-i is a second decoding signal constant for a previous time period, wherein 
said states roP and m^ are selected from said states m. 

Preferably, the step of normalizing said second decoding signal comprises the step of 

calculating: 

at(m) = at(m) - At 

where, at(m) are said current second decoding signals for states m, at(m) are said 
unnormalized second decoding signals for states m, At is a second decoding signal constant for 
the current period. 

Preferably, the step of obtaining current unnormalized second decoding signals is 
implemented by: 

at(m) = max.of-2 { Ot-iCm*') + TVi(m°) , at.i(mb + T\.i(m^) } 

where at(m) are said current unnormalized second decoding signals for states m, at.i(m^ and at- 
i(m^) are respectively said prior unnormalized second decoding signals, for states m^ and m^ 
Y^.i(m°) and Y\.i(m^) are respectively said two types of said prior first decoding signals for 
states m^ and m^ wherein said states m^ and m^ are selected from said states m. 

Preferably, said step of normalizing said current second decoding signal comprises the step of 
calculating: 
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at(m) = at(m) - At - (Ai + A2 + ... + Am) 

where, at(m) is said current second decoding signals for states m, at(m) is said unnormalized 
second decoding signals for states m, and Ai, A2, Am and At are respectively second 
decoding signal constants for the first to current periods. 

Preferably, said second decoding signal constant for the current period At is one of: 
At = max-of-all states { at(m) } 

At = ( max-of-all states { at(m) } + min-of-all states { at(m) }) / 2 
At = at(0) 

where at(m) is said unnormalized second decoding signals for states m. 

The method of the present invention preferably further comprises a step of setting the initial 
values of said unnormalized second decoding signals to selected constants. 

Preferably, the step of obtaining said unnormalized third decoding signals is implemented by: 
gt(m) = max-of-2 { gt+i(mV ^Vm), fitf i(m^)+ T\(m) } - Bt-i 

where gt(nx) are the current imnormalized third decoding signals for states m, fit+i(ni^)) and 
gt+i(m^) are respectively two values of the future unnormalized third decoding signals for 
states m^ and states m^ Y^(m) and T\(m) are respectively said two types of said current first 
decoding signals for states m, Bt-i is a third decoding signal constant for a prior period and 
said states m^ and m^ are selected from said states m. 

Preferably, said step of normalizing said third decoding signals is implemented by calculating 
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Pt(m) = fit(m)-Bt 

where, Pt(m) are the current third decoding signals for states m, gt(ni) are the current 
unnormalized third decoding signals for states m, and Bt is the current third decoding signal 
constant. 4 

Preferably, the step of obtaining said unnormalized third decoding signals is implemented by: 
fit(m) = max-of-2 { fit+i(m^)+ r\(m), fit+i(m^)+ Y^(m) } 

where gt(m) are the current unnormalized tWrd decoding signals for states m, fit+i(m^) and 
gt+i(m^) are respectively two values of the future unnormalized third decoding signal for 
states mO and ml, Y\(m) and T\(m) are respectively said two types of the current first 
decoding signals, wherein said states m^ and m^ are selected from said states m. 

Said step of normalizing said third decoding signal may comprise the step of calculating 
pt(m) = fit(m) - Bt - (Bi + B2 + ... + Bm) 

where, pt(m) are the current third decoding signal for states m, gt(m) are unnormalized third 
decoding signal for states m, and Bi, B2, Bm and Bt are respectively third decodmg signal 
constants for the first to current periods; 

The third decoding signal constant Bt for the current period may be : 
Bt = max-of-all states { fit(ni) } 

Bt = ( max-of-all states { fit(m) } + min-of-all states { gt(m) }) / 2 



Bt = fit(0) 
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Where fit(ni) are the current unnormalized third decoding signals for states m. 

The method of the present invention preferably further comprises a step of setting the initial 
values of said xmnormalized third decoding signals to selected constants. 

Preferably, both of said second decoding signals of the previous sequence and said third 
decoding signals of the next sequence are unnormalised. 

Preferably, the calculation of the internal signals is pipelined whereby the calculation for the 
next symbol sequence is commenced once the xmnormalized signals for the current symbol 
sequence have been calculated. 

The described embodiment of the invention is a modification of the calculations of decoding 
signals a and/or p in algorithm to allow for an improved pipelining implementation and 
allows S2(t-M) to be independent of the normalization in t, and can begin computation as soon 
as S2(t) is completed. The described embodiment also retains the same final values of the 
internal signals (the values after S4) as the prior art. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Embodiments of the invention will now be described, by way of example, with reference to 
the accompanying drawings, in which: 
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Fig. 1 illustrates a prior art timing diagram of the four stages of a/p computation for three 
sjonbols in a pipelining implementation. 

Fig. 2 illustrates a timing diagram of the four stages of a/p computation for three symbols in a 
pipelining implementation according to a first embodiment of the invention. 

Fig. 3 illustrates a timing diagram of the four stages of o/p computation for three symbols in a 
pipelining implementation according to a second embodiment of the invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS OF THE 
INVENTION 

The embodiments of the invention use modified max-log-MAP algorithms and a first 
embodiment will now be described. 

In the modified algorithm, the computations for a/p are modified so that pipelining 
implementation can be faster than that of the prior art. The main feature is to make the 
computation of a/g of symbol sequence t+1 independent of the normalization of a/fi of symbol 
sequence t. 

Equation (1) is modified so that at(m) is computed using xmnormalized values at-i(m) instead 

of normalized values aM(m), and a normalized constant Am is added. 

at(m) = max-of.2 { at.i(m^ -f YVi(m^) , at.i(m^) + Y\.x(m^) } - A t-i (8) 

Equation (2) remains the same. 
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at(m) = at(m) — At 



(9) 



Similarly, equations (4) and (5) are modified to 



fit(m) = max-of-2 { fit+i(mV Y"t(m), fitf i(m V r\(m) } - B, 



't-i 



(10) 



pt(m) = fit(m)-Bt 



(11) 



The final values of Ot and Pt are still exactly the same as the prior art. 

Following these modifications, the four dependent sequential stages described in the prior art 
are now slightly different. For example, to compute a for symbol sequence t, the four stages of 
the modified equations Slm(t) - S4m(t) are 

Slm(t): Compute TVi(m^) and Y\.i(m^) using Table 1 

S2m(t): Compute at(m) using equation (8) (At-i being known firom the previous iteration) 
S3m(t): Compute At using one of the equations (3a), (3b), (3c) 
S4m(t): Compute at(m) using equation (9) 

Compared with the prior art, the length of time taken for these modified stages is almost the 
same. SI and Sim take exactly the same computation time, and so do the pair S3 and S3m and 
the pair S4 and S4m. The length of time taken for stage S2m is slightly increased compared 
with S2, because S2m fiirther subtracts the constant Am- 



As in the prior art, it is possible to implement pipelining by starting S2 for the next symbol 
sequence t+l after S4(t). This is because Sl(t) is independent of any stage at any symbol 
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sequence. Figxire 1 illustrates an example of the timeline of such an implementation, with the 
assumption that the length of time taken for the stages S 1 - S4 follows the ratio 1:1:1 .5:0.5. 

However, the pipelining implementation can now be improved because S2m(t+1) can begin 
after S2m(t). This is because the normalization computations have been deferred. Figure 2 
illustrates an example of the timeline of the improved pipelining implementation according to 
the first embodiment of the present invention, with the assumption that the length of time 
taken for the stages Sim - S4m follows the proportion of Tlm:T2m:T3m:T4m = 1:1.5:1.5:0.5, 
which is the same as the assumption given in tiie prior art description except for the second 
stage. 

Stage 2 can start immediately after the end of the previous stage 2. Therefore, in general, the 
length of time taken to complete the stages given L nxmiber of symbols is 

Tm = T 1 m + T2m + (T3m + T4m) + (L - 1 ) x (T2m + T3m + T4m - T3 m - T4m) 

= Tlm + LxT2m + T3m + T4m (12) 

By comparing witfi the prior art, we can substitute Tl = Tim, T2 = T2m/L5, T3 = T3m, and 
T4 = T4m, and obtain 

T = Tim + L (T2m/1.5 + T3m + T4m) (13) 



The number of times of improvement in speed compared with the prior art is 



T/Tm = 
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(Tlm + L(T2ni/1.5+T3m + T4m))/(Tlm + LxT2m + T3m + T4m) 



(14) 



For there to be an improvement in speed, the ratio T / Tm must be more than one. Further 
simplification of T / Tm > 1 yields 

L > 1 + T2m / (3 X T3m + 3 x T4m - T2m) ( 1 5) 

It can be seen from equation (15) that there will be an improvement in speed as long as the 
frame size is not one, as T2m / (3 x T3m + 3 x T4m - T2m) is likely to be less than one. 
Practically, this means that there will be an improvement at any frame size. 

Usmg the example of Tlm:T2m:T3m:T4m = 1:1.5:1.5:0.5 in equation (14), the number of 
times of improvement in speed approaches 2 for large L. 

However, the improvement should be smaller in the case where S3 is computed using a more 
complex method such as in equations (3a) and (3b), because a relatively small T3 produces a 
small T / Tm ratio. 

In a second embodiment, the algorithm described as the first embodiment can be slightly 
modified and still retain the properties of independent normalization and same final values as 
now described. 

Equation (8) and (9) can be rearranged as 



at(m) = max-of-2 { at.i(m^ + YVi(ni^ , at.i(m*) + Y\.i(m^) } 



(16) 
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and 

at(m) = at(m) - At - (Ai + A2 + ... + Am) (17) 

This shortens the procedure in (8) and makes the procedure in (9) longer, but the effect is the 
same. 

Similarly, equations (10) and (11) can be rearranged as 

fit(m) = max.of-2 { fit+i(m^H- Y^m), fit+i(m^)+ r\(m) } (1 8) 

pt(m) = fit(m) - Bt - (Bi + B2 + ... + Bm) (1 9) 

The stages Sim and S3m are still the same, but S2m and S4m are slightly different. For 
example, for a computation of second embodiment, the four stages are 

Slm2(t): Compute YVi(m^) and Y\.i(m^) usmg Table 1 

S2m2(t): Compute at(m) using equation (16) 

S3m2(t): Compute At using one of the equations (3a), (3b), (3c) 

S4m2(t): Compute at(m) using equation (17), and store (Ai + A2 + ... + At) for future use (in 
the next symbol sequence t+1) 

Compared with the prior art, the length of time taken for these modified stages is almost the 
same. SI and Slm2 take exactly the same computation time, and so do the pair S2 and S2m 
and the pair S3 and S3m. S4m2 is only two times longer than S4 because instead of subtracting 
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one constant, S4m2 subtracts two constants (the second one is the stored value from the 
previous S4m2 symbol). 

In general, the length of time taken to complete the stages given L number of symbols is 

Tm = Tlm + T2m + T3m + LxT4m (20) 

By comparing with the prior art, we can substitute Tl = Tim, T2 = T2m, T3 = T3m, and T4 = 
T4m/2, and obtain 

T = Tim 4- L (T2m + T3m + T4m/2) (21) 

The number of times of improvement in speed compared with tiie prior art is 

T/Tm = (Tlm + L(T2m + T3m + T4m/2))/(Tlm + T2m + T3m + LxT4m) (22) 

For there to be an improvement in speed, the ratio T / Tm must be more than one. Further 
simplification of T / Tm > 1 yields 

L> H-(T2m/2)/(T2m + T3m-T4ni/2) (23) 



It can be seen from equation (23) fliat there will be an improvement in speed as long as the 
frame size, L, is not one, as the ratio (T2m/2) / (T2m + T3m - T4m/2) is likely to be less than 
one. Practically, this means that there will be an improvement at any frame size. 
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Figure 3 illustrates an example of the timeline of the improved pipelining implementation 
according to the second embodiment of the present invention. In Fig. 3, it is assumed that the 
length of time taken for the stages Sim - S4m follows the proportion of Tlm:T2m:T3m:T4m 
= 1:1:1.5:1. Using the above proportion in equation (22), the number of times of improvement 
in speed approaches 3 for large L. 

However, the improvement should be larger in the case where S3 is computed using a more 
complex method such as in equations (3a) and (3b), because a relatively large T3 produces a 
large T / Tm ratio. 

This algorithm may still significantly faster than the prior art. 

The modification of a/p computation in the max-log-MAP algorithm as explained in the first 
and second embodiments can also be applied to a BCJR (fuU-MAP) algorithm, or a log-MAP 
algorithm. Furthermore, the algorithm may be applied to the calculation of a or p or both. 

In summary, a method for calcixlating decoding signals in a MAP algorithm, such as max-log- 
MAP algorithm, has been disclosed. The above-described embodiriients of the invention are 
intended to be illustrative only. Numerous altemative embodiments may be devised by those 
skilled in the art without departing firom the scope of the following claims. 



